Philosophy of IR Evaluation

نویسنده

  • Ellen M. Voorhees
چکیده

• System evaluation: how good are document rankings? • User-based evaluation: how satisfied is user? NIST Why do system evaluation? • Allows sufficient control of variables to increase power of comparative experiments – laboratory tests less expensive – laboratory tests more diagnostic – laboratory tests necessarily an abstraction • It works! – numerous examples of techniques developed in the laboratory that improve performance in operational settings NIST Cranfield Tradition • Laboratory testing of retrieval systems first done in Cranfield II experiment (1963) – fixed document and query sets – evaluation based on relevance judgments – relevance abstracted to topical similarity • Test collections – set of documents – set of questions – relevance judgments NIST Cranfield Tradition Assumptions • Relevance can be approximated by topical similarity – relevance of one doc is independent of others – all relevant documents equally desirable – user information need doesn't change • Single set of judgments is representative of user population • Complete judgments (i.e., recall is knowable) • [Binary judgments] NIST The Case Against the Cranfield Tradition • Relevance judgments – vary too much to be the basis of evaluation – topical similarity is not utility – static set of judgments cannot reflect user's changing information need • Recall is unknowable • Results on test collections are not representative of operational retrieval systems

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Meaning in philosophy and meaning in information retrieval (IR)

Purpose -The paper explores the question of whether the differences between meaning in philosophy and meaning in information retrieval (IR) have implications for the use of philosophy in supporting research in IR. Design/methodology/approach Conceptual analysis and literature review. Findings There are some differences in the role of meaning in terms of purpose, content and use which should be ...

متن کامل

Zarathrustrian Mind: Some Comparative Reflections on the Philosophy of Zarathrustra

This paper deals with an essential problem which the modern western thinker faced with and tried to find a solution for that in the benefit of modern humanity. This problem is human reason and his free mind. The author tries here to go back to Zarathrustrian concept of mind and bring forth some fresh reflections in a comparative way. This will let him to evaluate in the main the view that argue...

متن کامل

NTCIREVAL: A Generic Toolkit for Information Access Evaluation

Over the past decades, Information Access (IA) tasks have evolved and diversified. For example, in the mid20th century, Information Retrieval (IR) was about set retrieval for libraries; then with the advent of the digital information overload era, ranked retrieval became a necessity; now in the 21st century, we are experiencing richer forms of IR such as diversified Web search in order to satis...

متن کامل

لذت بیماری در پرتو حکمت بیماری از منظر صحیفه سجّادیّه

Background and Objective:  Pain and sickness are undeniable facts in the material life. This study was aimed to evaluate Imam Sadjad's points of view about sickness. We would like to know if sickness can be an enjoyable or disgusting phenomenon.  Materials and Methods:  With respect to the objectives of our study, research material comprised of Imam Sadjad trainings and Sahife Sajadieh worships...

متن کامل

Comparative Evaluation of the Efficacy of Wisdom and Inspiration in Abu Hatam’s and Zakariyya Razi’s Opinions

The question has always been raised throughout history whether human beings need to follow the prophets or divine revelation to achieve salvation. There will be no need, as some believe, to follow divine teachings once human beings reach intellectual maturity; however, others insist on the permanent need for Guidance from God due to inadequacy of human reason. In the third or fourth century AH,...

متن کامل

Evaluation the protective effects of doxycycline on acetaminophen-induced hepatotoxicity in mice

Acetaminophen (APAP) toxicity threatens human health due to increased mortality associated with its overdose. Doxycycline (DC) because of its properties such as antioxidant and anti-inflammatory can be a good therapeutic strategy to treat the acute toxicity induced by APAP. Male mice were divided to six groups in two periods of 3 and 24-h as normal saline, APAP 400 mg/kg, DC 100 mg/kg and group...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001